首页> 外文OA文献 >A curated dataset of complete Enterobacteriaceae plasmids compiled from the NCBI nucleotide database
【2h】

A curated dataset of complete Enterobacteriaceae plasmids compiled from the NCBI nucleotide database

机译:从NCBI核苷酸数据库汇编的完整肠杆菌科细菌的精选数据集

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Thousands of plasmid sequences are now publicly available in the NCBI nucleotide database, but they are not reliably annotated to distinguish complete plasmids from plasmid fragments, such as gene or contig sequences; therefore, retrieving complete plasmids for downstream analyses is challenging. Here we present a curated dataset of complete bacterial plasmids from the clinically relevant Enterobacteriaceae family. The dataset was compiled from the NCBI nucleotide database using curation steps designed to exclude incomplete plasmid sequences, and chromosomal sequences misannotated as plasmids. Over 2000 complete plasmid sequences are included in the curated plasmid dataset. Protein sequences produced from translating each complete plasmid nucleotide sequence in all 6 frames are also provided. Further analysis and discussion of the dataset is presented in an accompanying research article: “Ordering the mob: Insights into replicon and MOB typing…” (Orlek et al., 2017) [1]. The curated plasmid sequences are publicly available in the Figshare repository.
机译:现在,成千上万的质粒序列可在NCBI核苷酸数据库中公开获得,但不能可靠地注释它们,以区分完整的质粒与质粒片段,例如基因或重叠群序列;因此,检索完整的质粒用于下游分析是一项挑战。在这里,我们介绍了临床相关肠杆菌科的完整细菌质粒的精选数据集。使用精心设计的步骤从NCBI核苷酸数据库中汇编数据集,该步骤设计为排除不完整的质粒序列,并将染色体序列错误标注为质粒。精选的质粒数据集中包含超过2000个完整质粒序列。还提供了通过在所有6个框架中翻译每个完整质粒核苷酸序列产生的蛋白质序列。随附的研究文章中对数据集进行了进一步的分析和讨论:“订购暴民:洞悉复制子和MOB分型……”(Orlek等,2017)[1]。精选的质粒序列可在Figshare资料库中公开获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号